# Industrial image analysis
Internvl3 8B
Apache-2.0
InternVL3 - 8B is an advanced multimodal large - language model with excellent multimodal perception and reasoning capabilities, capable of processing multimodal data such as images and videos.
Multimodal Alignment
Transformers

I
unsloth
224
1
Internvl3 1B GGUF
Apache-2.0
InternVL3 - 1B is an advanced multimodal large language model that excels in multimodal perception, reasoning, and other abilities. It also expands multimodal capabilities such as tool use and GUI agent.
Multimodal Fusion
Transformers

I
unsloth
868
2
Internvl3 38B Hf
Other
InternVL3-38B is an advanced multimodal large language model (MLLM) with significant improvements in multimodal perception and reasoning abilities, supporting areas such as tool use, GUI agents, industrial image analysis, and 3D visual perception.
Image-to-Text
Transformers Other

I
OpenGVLab
2,226
3
Internvl3 14B Hf
Other
InternVL3-14B is a powerful multimodal large language model that excels in multimodal perception and reasoning abilities and supports multiple inputs such as images, texts, and videos.
Image-to-Text
Transformers Other

I
OpenGVLab
4,260
0
Internvl3 8B
Other
InternVL3-8B is an advanced multimodal large language model with excellent multimodal perception and reasoning capabilities, and performs well in multiple fields such as tool use, GUI agents, and industrial image analysis.
Multimodal Fusion
Transformers Other

I
FriendliAI
167
0
Featured Recommended AI Models